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Abstract 

In this work we extend the setting of communication without power constraint, proposed by Poltyrev, to fast 
fading channels with channel state information (CSI) at the receiver. The optimal codewords density, actually the 
optimal normalized log density (NLD), is considered. Poltyrev's capacity for this channel is the highest achievable 
NLD, at possibly large block length, that guarantees a vanishing error probability. For a given finite block length 
' n and a fixed error probability e, there is a gap between the highest achievable NLD and Poltyrev's capacity. 

■ As in other channels, this gap asymptotically vanishes as y^Q^(e)^ where Q is the standard complementary 

Gaussian c.d.f. and V is the channel dispersion. This dispersion, derived in the paper, equals the dispersion of 
the power constraint fast fading channel at the high SNR regime. Connections to the error exponents of the finite 
] constellations average/peak power constraint channel are also discussed. 

I. Introduction 



(N 
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' Wireless communication channels are traditionally modeled as fading channels, where the transmitted 
signal is multiplied by a fading process and observed with additive white Gaussian noise (AWGN). In a 
fast fading channel the fading process is composed of fading coefficients, modeled as independent and 
identically distributed (i.i.d.) random variables. This is a reasonable model for many practical wireless 
communication systems, such as systems that use a (pseudo) random interleaver between the transmitted 
digital symbols (e.g. BICM techniques) over, e.g., a frequency selective wireless channel. Here we will 
IT) assume that a perfect knowledge of the channel state information (the fading coefficients) is available at 
\Q the receiver. 

O ■ Classical coding problems over the fading channels often include a peak or an average power restriction 
^ . of the transmitted signal. Without power constraint the capacity of the channel is not limited, since we 
can choose an infinite number of codewords to be arbitrarily far apart from each other, and hence get an 
arbitrarily small error probability and infinite rate. Nevertheless, coded modulation methods ignore the 
rS power constraint by designing infinite constellations (IC), and then taking only a subset of codewords 
c3 which are included in some "shaping region" to get a finite constellation (FC) that holds the power 
constraint. Hence, IC is a very convenient framework for designing codes. 

Poltyrev studied in [4J the IC performance over the AWGN without power constraint. He defined the 
density (the average number of codewords per unit volume) and the normalized log density (NLD) of 
the IC, in analogy to the number of codewords and the communication rate in the power constraint 
model, respectively. He showed that the highest achievable NLD over the unconstrained AWGN channel, 
with arbitrarily small error probability, is limited by a maximal NLD, sometimes termed the 'Poltyrev's 
capacity'. He also derived an exact term for the maximal NLD and error exponent bounds using random 
coding and sphere packing techniques, for any NLD below the capacity. 

In classical channel coding problems, the capacity gives the maximal achievable communication rate 
when arbitrarily small error probability is required (and arbitrary large codeword length n is permitted). 
The error exponent provides the exponential rate of convergence (with n) in which the error probability 
goes to zero, for any fixed rate below the capacity. Another interesting question is: for a fixed error 
probability e and a fixed codeword length n, what is the maximal achievable rate, denoted by R*(n,e). 
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Although this question is still unsolved for any finite n, the recently revisited dispersion analysis [12] gives 
the rate of convergence of R* (n, e) to the capacity. According to the dispersion analysis, for any fixed e 
and finite n the following holds: 

R*{n,e) = C-^Q-\e)+0(^^y (1) 

where Q is the standard complementary Gaussian CDF, C is the channel capacity and V is the channel dis- 
persion. The channel dispersion is given by the variance of the information density i{x; y) =\n' '^^^'^^ 



for a capacity achieving input distribution. Polyanskiy et al. showed in [2J that ([U) holds for discrete 
memoryless channels (DMCs) and for AWGN channel. In L3J the result was extended to stationary fading 
channels. 

In [IJ Ingber et al. showed that in AWGN channel without power constraint and with noise variance 
cr^, the analogy of & for IC is given by: 

r(,,.) = *--y^Q-'W + oM. (2) 

y n \ n J 

where 5*{n, e) is the optimal NLD for fixed e and finite n, and 5* = | In ( ^^^^2 ) is Poltyrev's capacity. For 
AWGN, the channel dispersion is given by V = |, which is equal to the limit of the channel dispersion 
of the power constrained AWGN, where the SNR tends to infinity. 

In this paper we extend Poltyrev's setting to the case of a fast fading channel with AWGN and without 
power constraint. The main result of this paper is that an analogous expression to © holds for fast fading 
channels. Moreover, the dispersion of unconstrained fast fading channel, derived later in the paper, equals 
the limit of the dispersion of the fast fading channel with power constraint, derived in (3], where the SNR 
tends to infinity. 

In the achievability part of the proof, we will use the Dependence Testing Bound that was used in [2] to 
prove the achievability part of ([B for DMCs. This bound is based on random coding and on a suboptimal 
decoder. The suboptimal decoder is based on information density threshold crossing. Here, we will use 
this bound for bounding the average error probability over the ensemble of codes with codewords that 
are uniformly distributed on an n-dimensional cube with length a. By letting a tend to infinity, we will 
prove the existence of an IC with NLD that is lower bounded by the right hand side (RHS) of Q. In the 
converse part of the proof, we will use the sphere packing bound for the average error probability and its 
asymptotical distribution for large n, in fast fading channels. 

The paper is organized as follows. In section |ll] notations and basic definitions for the fading channel 
model and for IC's are given. In section [III] connections to the power constrained channel model are 
discussed. In section |IV] our main result is presented and proved. In section |V] we briefly extend our main 
result to the complex channel model. Finally, we summarize the paper in section |Vll 



II. Basic Definitions 

A. Notation 

Vectors are denoted by bold-face lower case letters, e.g. x and y. Matrices are denoted by bold-face 
capital letters, e.g. H. Components of random vector x are denoted by capital letters, Xi, X2, . . . , X„. In 
the same manner, components of a random matrix H are denoted by {Hij}. Instances of random variables 
(RVs) are denoted by lower case letters, e.g. x, y and h. 



B. Channel Model 

The fast fading channel model is given by 

Yi = H,-X, + Z,,, 2 = 1,2, 



(3) 
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where, 

• {Xi} is a series of channel inputs, 

• {Hi} is a series of i.i.d. fading coefficients satisfying E{Hf} = 1, 

• {Zj} is a series of i.i.d. normal random variables, such that Zi ~ N{Q, cr^), 

• {Fj} is a series of channel outputs. 

The RVs {Xi}, {Hi} and {Z^} are independent of each other. In vector notation (for finite n) the channel 
model is given by: 

y = H-x + z, (4) 

where H = diag {Hi, H2,..., H^}. We assume a perfect CSI available at the receiver, and hence the 
receiver's channel output is the couple (y,H). 

Without loss of generality, since we have a perfect CSI at the receiver, we can assume that the fading 
coefficients are nonnegative. Moreover, we restrict the fading distribution to probability density functions 
(PDF) with zero probability to equal zero. We will denote such a fading distribution by regular fading 
distribution, which is defined formally below. 

Definition 1. (Regular fading distribution): A fading PDF f (h) is called regular fading distribution if 
there exists some positive constant a, s.t. f {h) oc ^^1^ for small enough h > 0. 

A popular statistical model for the fading channel is the Nakagami-m distribution. It is easy to verify 
that this distribution is a regular fading distribution for all m > |. 

In this paper, we investigate the dispersion of regular fading channels in finite dimensional IC with 
available CSI at the receiver. 



C. Infinite Constellations 

An infinite constellation of dimension n is any countable set of points = {si, S2, ■ ■ ■ } in 
Let Cb(a) denote an n dimensional hypercube in M'*: 



Cb 



{a)^[KeW s.t.yi\xi\<^Y (5) 

We denote by M (S, a) = |5'P|Cb(a)| the number of points in the intersection of Cb(a) and S. 
The density of points per unit volume of S is denoted by 7 and defined by 

^ M{S,a) 
7 = hm sup — . (6) 

The normalized log density of S is denoted by 5 and defined by 

<^ = -ln(7)- (V) 
n 

In the receiver, given the channel state information, the receiver's IC, denoted by -S'h, is defined by 

'S'h = {v : Src = H ■ s, s G S"} . (8) 

We also define the set H ■ Cb(a) as the multiplication of each point in Cb(a) with the matrix H. 
The density of .S'h is defined by 

7rc H = hm sup , , (9) 

a^oo Vol(H-Cb(a)) 

Mis, a) 

= lim sup , ' (10) 

„^oo det(H)-a" 

7 



det(H) <'» 
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where M{Sn,a) = |5h fl H ■ Cb(a)|. 

For G ^H, let Pg (srclH) denote the error probability when s, such that Src = H ■ s, was transmitted 
and the CSI at the receiver is H. Then, using maximum likelihood (ML) decoding the error probability 
is given by 

Pe (src|H) = Pr {src + Z^W M |H} (12) 

where W (src) is the Voronoi cell of Src, i.e. the convex polytope of the points that are closer to Src than 
to any other point G Sn- 

Definition 2. (Conditional expectation over a faded hypercube): For any function f : S'h — the 
conditional expectation of f{s^c) given H, where s^c is drawn uniformly from the code points that reside 
in the faded hypercube H • Cb(a), will be denoted and defined by 

^ ' ' s,-ce5HnH-Cb(a) 

The average error probability using ML decoding and equiprobable messages transmission is given by 

Pe (S) = E {Pe (^h)} = E \ limSUp I Yl PeiSrc\ii)\ (14) 



^ E \ limsup Es,a\H {Pe (v|H)} \ . (15) 

IIL Relation to the Power Constrained Model 

The Gallager's error exponents at rates near the capacity can be approximated by a parabola of the 
form 2 

E(P)-^^^, (16) 

where V is the channel dispersion. This fact was already known to Shannon (see Fig. 18 in [2]). By 
taking uniform input distribution, precisely X ~ ^("f't)' ^^^^ the power constrained fast fading 

channel with available CSI at the receiver, it can be shown that (fT6l) holds with C = 1 1 In j | 

and V = \ + Var (| In {H'^)), where a/ a tends to infinity (the high SNR regime). Since the unconstrained 
setting can be thought of as the limit of the power constrained setting when the SNR tends to infinity, 
this result hints that 5* = E 1 1 In ( gS^) } and = | + Var (^^ In 2^^) ) • 

In I13JI Polyanskiy et al. studied the dispersion of the general case of power constrained stationary 
fading channels. He showed that the dispersion is affected by the fading dynamics, this is in contrary 
to the channel capacity, which is independent of this dynamics [8]. Moreover, in some fading processes, 
such as Gauss-AR processes, this dispersion is increased relative to fast fading channel with the same 
marginal fading distribution. This fact can motivate the useage of random interleaver in practical systems 
with finite block-length, in order to get effectively a fast fading channel (with smaller channel dispersion). 
In case of fast fading channels with power constraint P, and AWGN variance a^, this dispersion (in nats^ 
per channel use) is given by 

V = Var (i lu (1 + SNR.H^-)) + 1 (l - E'' { .^sNR-H'' }) ' 

where SNR = P/o"^. Another indication to the channel dispersion value in the unconstrained case is 
given by taking the limit of (fTTI) when the SNR tends to infinity. In the high SNR regime (fTTI) can be 
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approximated by 



1 f\ 
r - + 1/ar f - In [SNR ■ H^) 

= \ + Var Qln(i/^)), 



(18) 
(19) 



which coincides with the previous hint to the channel dispersion value in the unconstrained setting. The 
case of stationary fading channels with memory is a subject for further research. 

It should be noted that while the dispersion analysis accuracy of ^ in case power constrained fading 
channels in Q is O {^'^^^ our analysis the accuracy is slightly better, O (^^^^f^j- 

In Fig. [T] we can see the power constrained channel dispersion rate of convergence to the unconstrained 
channel dispersion limit, with growing SNRs, at the popular Rayleigh fading channel. In Fig. [2] we can 
see the unconstrained channel dispersion for the Nakagami-m fading, for various values of m. It can be 
seen that when m — t- oo the dispersion converges to the unconstrained AWGN channel dispersion |, as 
expected. 
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Fig. 1. The power-constrained Rayleigh fading dispersion vs. the unconstrained dispersion. 
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Nakagami - M fading dispersion 



2.4 



Nakagami - M 
AWGN 




Fig. 2. The IC dispersion of Nakagami - M fading channel converges to the dispersion of the AWGN channel. 



IV. Main Result 

Theorem 1. Let t be a given, fixed, error probability. Denote by 5* {n, e) the optimal NLD for which 
there exists an n-dimensional infinite constellation with error probability at most e. Then, for any regular 
fading distribution of H, as n grows. 



5*{n,e) = 6* 



n 



n 



(20) 



where, 5{H) ^ | In (3^) and 5* = E {5{H)} = E |i In 



27160-2 



Corollary 1. The highest achievable NLD with arbitrary small error probability, over the unconstraint 
fast fading channel with available CSI at the receiver, is given by 



6* = El-\n 



2 \27vea^ J j 



(21) 



Proof: By taking the limit n — )■ 00 in (|20l) we get the desired result (for any < e < 1). ■ 
Moreover, by Jensen's inequality and the concavity of the logarithm function, we can derive the 
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following result: 



This proves that in the AWGN channel the Poltyrev's capacity is greater than its equivalent in the fast 
fading channel (with the same noise variance cr^). This loss, relative to the AWGN channel, is given 
exactly by —E {\\\{H)} in nats. Alternatively, this loss can be measured as the ratio between the highest 
noise variance that is tolerable in each channel model. It is easy to show that this ratio is given by 
^-2E{\n{H)} linear scale, or by —S.QSh^E{\\\{H)} in dB. For example, this loss equals approximately 
0.288 nats or 2.5 dB in the Rayleigh fading channel. 



A. The Sphere Packing Bound 

In this section we will prove the following sphere packing bound for any IC 5* with NLD 5: 

{S) > Pf (5) ^ Pr I llzf > e-^ (^) " } • (23) 

First, we will focus on IC where all the Voronoi cells have equal volume V. In the receiver, given 
the CSI H, we get an IC with Voronoi cell volume that equals Vrc = V ■ det(H) = VU'l^^Hi. By the 
equivalent sphere argument [iUdS], the probability that the noise leaves the Voronoi cell in the receiver is 
lower bounded by the probability to leave a sphere of the same volume: 

Pe(5)>Pr{||zf >r,VH)}, (24) 

where 

Krrff(H) ^ Kc (25) 

and 

Vn = T^. (26) 

2 \2) 

Combining ([25]), with the definition of 6 = ln{V) we get: 

Now we will extend the correctness of the bound to any IC with bounded Voronoi cells volume (regular 
IC's). 

Definition 3. (Regular IC's): An IC S is called regular if there exists a radius tq > 0, s.t. for all s E S, 
the Voronoi cell W{s) is contained in Ball(s,ro) = {x G s.t. ||x — s\\ < tq}. 

For s e S, denote by v{s) the volume of the Voronoi cell of s, and denote by (S*) the average Voronoi 
cell volume of S. Then, by definition 

V{S)^limmiEs,a{v{s)} = \imm{ 1 V v{s). (28) 

a-^oo a-s>oo ikZ O, O ^ — ^ 

^ ^ se5nCb(a) 

It is easy to verify that for any regular IC, the density is given by 7 = y^- 

Clearly, for any given H, the receiver IC is also regular. Hence, in the same manner, we can define the 
receiver's average Voronoi cell volume of Sh by 

V (Sh) = Mminf Es,a\n Msrc)} ■ (29) 

a—^oo 
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The density at the receiver is given by 7rc = viSa) ^ dctln) • 

To prove the sphere bound for regular IC's it is desirable for the clarity of the proof to denote by 
SPB (t^|H), the probability that the noise vector z leaves a sphere of volume v given the CSI H. With 
this notation, 

Pe (src|H)> SPB (t; (v) |H) = Pr i ||zf> 



Vn 



H } (30) 



for any s^^ ^ •S'h- 



Theorem 2. For any regular IC S with NLD 5, the average error probability is lower bounded by the 
following sphere packing bound 

Pe{S)>P^'{6). (31) 

Proof: By definition the average error probability is given by 

Pe (S) ^ E |limSUpE5,a|H {Pe (Src|H)}| (32) 

a— >oo J 

> E (limsupP5,a|H {SPB {v |H)}| (33) 

> E (limsup SPB (E5,a|H {v (s,c)} |H) I (34) 

= E jsPB (^lim sup Es,a\H {v M} |H^ | (35) 

= P{SPB(y(5H)|H)} (36) 



Pr<!||zf >(-^l } (37) 



V{Sn 




Pr<! ||zf >e-2M^^^^ Upf(5) (38) 



where (|33T ) follows from the sphere packing bound for each s^c ^ •S'h, (l34l) follows from Jensen's inequality 
and the convexity of the function SPB (f |H) in v and (|35] ) follows from the fact that SPB (tilH) is 
monotone decreasing and a continuous function of v. All the next steps are trivial. ■ 
In the next theorem we will extend the correctness of the sphere packing bound for any IC. This 
includes IC's with unbounded Voronoi's cells and IC's with density which oscillates with the cube size 
a (i.e. only the limsup exists in the definition of 7). The proof is based on a very similar regularization 
process as done in [1, Lemma 1] for AWGN channels. Here, in the fading channel case, we will need 
to separate from the analysis all the "strong" fading channel realizations, which are formally defined in 
the following, and use the regularization process only for the rest of the "weak" fading realizations. By 
showing that the "strong" fading realizations in regular fading distributions are an arbitrarily small fraction 
of the whole realizations space, we will complete the proof of the bound. 

Definition 4. (C, - strong fading realization): Let us denote by R = diagj/ii, . . . , a fading channel 
realization drawn from a regular fading distribution of the random fading matrix H. For a given > 0, 
let us define a fading threshold /imm(0 ^he solution of Pr{/7rnm < ^min) = where i/min — 
min(ifi, . . . , if„). If h^nm — min(/;,i, . . . , hn) < ^min(0 ^^^^ H is called a ^ - strong fading channel 
realization. 

Lemma 1. (Regularization): Given the fading channel realization H, let Su be an IC with density 7rc (H) 
and average error probability Pe {Su) = e(H). For any ^ > 0, z/H is not a ^ - strong fading realization 
then there exists a regular IC, denoted by S^, with density •y^.^ (H) > 7^ (H) / {I + ^) and average error 
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probability (^h) < e(H)(l + 0- 

Proof: See Appendix |Al ■ 

Theorem 3. For any IC S with NLD 5, the average error probability is lower bounded by the following 
sphere packing bound: 

Pe (S) > Pf (6) . (39) 

Proof: For a given H, denote the receiver IC by ^h- For any ^ > 0, by the regularization lemma, if 
H is not a ^ - strong fading realization, then there exists a regular IC, denoted by S^, with density 

1 



7,e(H)>7rc(H)/(l + 



7 



+ det(H) 



and average error probability 



Pe (^h) < Pe (Sh) (1 + 



(40) 



(41) 



where 7 = e"^. Moreover, by the ^ - strong fading definition Pr {ifmin < ^min) = ^- Following this, we 
can derive the inequalities below: 



'l+()PAS) = E[{l + OP,{Su)} 

>b{(i + OP.(s„).1{„„,.„.,^j} 

>j5ip.rs„).i{„,,.,,.j; 



> E |SPB 

> E jsPB 

= e|spb 

> E jsPB 

= e/spb 



7-Met (H)(l + 
7-Met (H)(l + 
7-Met (H)(l + 



1 - 1 



} - Pr < J 



7-Met (H)(i+o|H)}-e 



(42) 
(43) 

(44) 

(45) 

(46) 

(47) 

(48) 

(49) 



where (|45T ) follows from the regularity of 5*^, (l46l) is due to the fact that SPB(-|H) is a monotone 
decreasing function and (|48] ) is due to SPB (-IH) < 1. 
Equivalently, we get the following: 



Pe (S) > E 



SPB (^7-1 det (H) (1 + 



H 



(50) 



for all ^ > 0. Since SPB(-|H) is a continuous function we can take the limit — > (meaning implicitly 
that the "strong" fading realizations are an arbitrarily small fraction of the whole realizations space in 
regular fading distribution) and get the sphere packing lower bound: 



Pe (S) > E jsPB (^e"'"^ det (H) 



H 



Pr 



, ,|2 . -25 /'det(H) 
z > e 



P- (6) 



(51) 
(52) 



By taking the fading matrix H to be equal constantly to the identity matrix J„, the bound (|23]) coincides 
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with the sphere packing bound of the unconstrained AWGN channel, which is given byPr|||z|| >Vn "e 
Although this one dimensional integral is hard to evaluate analytically for general n, Ingber et al. derived 
in yj an easy to evaluate and very tight analytical bounds for it. These bounds coincide with the sphere 
packing bound's error exponent, derived by Poltyrev in [4], for asymptotic n. Moreover, Tarokh et al. 
represented this integral in [5 | as a sum of n/2 elements, which helps in numerical evaluation of the bound. 
In contrast, in the case of fading channel the sphere packing bound (l23l) is an n + 1 dimensional integral, 
which is extremely hard to evaluate both numerically and analytically. Nevertheless, in the asymptotic 
case, this bound can be approximated by normal distribution according to the central limit theorem, which 
helps us to prove the converse part of our main result. 

B. Proof of Converse Part 

Assume a transmission of IC S with NLD 5 over the fading channel. By the sphere packing lower 
bound of Theorem |3l 

)SB _ D„ J ii„ii2 ^ „-25 / det(H' 



> P^^ (5) = Pr { \\zr > e--^' [ ] }. (53) 

In fP] Ingber et al. proved the converse part of the dispersion analysis, in the unconstrained AWGN 
channel, by approximating the distribution of ||z||^ = XlILi ^ normal distribution using the Berry- 

Esseen lemma (see Lemma [3]) for sum of i.i.d RVs. Here, we cannot use the same analysis due to the 
fact that H is also random. By taking the logarithm and rearranging of the inequality in the argument of 
(1531) we get: 

p. > p j!!iM)_!!^ _ (,„(^.) _ E{MH)}) (54) 

1=1 

2 \ ncr^ / I n 



>V2n(E<!-ln( — ) }. (55) 



For simplicity, let us define r„ 4 Hm')-H-<^') g a where X, 4 Hm)-^iHH)} ^^^^ 



t = l,..,n) and Cn = - ^/Var{6{H))Sn to get: 



Pe>Pr{Cn>(}, (56) 



where C = In 



Although (n is a sum of n + 1 independent RVs, and despite of the existence of expansions for the 
Berry-Essen Lemma for a sum of independent RVs with varying distributions, in the standard derivation 
of these expansions it is assumed that all the RVs' variances are of the same order (see [6, pp. 542-548] 
for details). Here, Var{Yn) = 0(1) (see Lemma O and Var (^^^ = O (^). Hence, a more careful 
analysis should be done for proving that the distribution of Cn is approximately normal. The following 
three lemmas allow it. By Lemma [2] and by Lemma |3] we prove that the PDF of Yn and the CDF of 
Sn are approximately normal for large enough n, respectively. Finally by Lemma |4] we prove that the 
distribution of a sum of two independent RVs, each of which has an approximately normal distribution, 
is also approximately normal. Therefore, the distribution of Cn is also approximately normal for large 
enough n. 

Lemma 2. (Log of chi square distribution) Let Yn = where X ~ X^in). Then 
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and for asymptotically large n: 



/y„(y) = iV(0,l)+O 



ye 2 



n 



(58) 



where A^(0, 1) is the standard normal distribution's PDF. 

Proof: See Appendix El ■ 

Lemma 3. (Berry-Esseen) Let Xi, X2, ■ ■ ■ , Xn be n i.i.d. random variables with mean, variance and 
third absolute moments that equal fx = E{Xi},a'^ = Var{Xi) and = E{\Xi — p\^}, respectively, for 
i = 1, . . . ,n. If the third absolute moment exists, then for a// — 00 < s < 00 and n, 

6p3 



< 



(59) 



where Sn = and Fn{o,i){-) is the standard normal distribution's CDF. 

Proof: See Theorem 1 (Berry-Esseen for sum of i.i.d. RVs) in [6,, pp. 542]. ■ 

Lemma 4. (Sum of two almost normal RVs) Suppose that Xi and X2 are two independent random 
variables s.t. the PDF of Xi is given by 



( 



/x,(xi) = iV(0,a2) + O 



\ 



\ 



n 



N(0,al)\X2) + O [ 



and the CDF of X2 is given by 

Fx,{x2) = F, 

Let Y ^ Xi + X2, then the following holds: 

Fy(y) = Fjv(0,.g)(Z/)+O 

where CTy — crj + o"2- 

Proof: See Appendix O 
Combining Lemmas [2l [3] and |4] we get: 



(60) 



Pe>Q 




C 



i + Var{6{H)) 



O 



(61) 



By Stirling approximation for the Gamma function, Vn can be approximated as 



ln(K) 



In 



n 



27re 



n 



and hence we get: 



( = V^i E 



2 \27Tea^ J 



\n(n) + O 

2n ^ ' 



1 



5 + — ln(n) + O - 



2n 



n 



(62) 



(63) 
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The assignment of (1631 ) in (|6T1) gives us: 

e > Pe > Q — ^ ^ \-0(^ . (64) 



Taking Q ^(■) from both sides of (|64l) gives us: 
5 < E J - In 



By Taylor approximation (around e) Q^^ + O (^:;;^)) = Q^l^) + ^ (^)' ^^^^^ gives us the 
desired result: 



C. Dependence Testing Bound 

In this section we will extend Polyanskiy's Dependence Testing Bound (see Theorems 17 and 18 in 
|l2l) for the case of fast fading channel with available CSI at the receiver. In [2 J the DT bound was 
used to prove the dispersion analysis for DMC, or more precisely, for memoryless channels without a 
power constraint (or any other constraint on the channel input). Here, the channel input doesn't have any 
restriction, and hence we can use the DT bound to prove the direct part of our main result. 

Theorem 4. (DT bound) For any input distribution fx{x) on M, there exists a code with M codewords 
and an average error probability over the fast fading channel, with available CSI at the receiver, not 
exceeding 

Pe < Pr I ^(x; y, H) < In {^^\ \ + ^^Pr (^(x; y, H) > In (^^\ \ , (67) 



or equivalently, 

p^<^|e-[^(x;y,H)-in(«^)]-| 



Pr [^ (x; y, H) < In (^) } + {e-(--H)l{,.^,,H)>K^)} 



(68) 



where fxYYui^iViyi^) = fx{x) fY\x,H{y\x, h) fY\H{y\h) fnih) is the marginal joint distribution of all 
the random vectors arising above and i(x; y, H) = In ' ^^^'^'^^ 



J{x)/{y,H)^ 

Proof: The proof is based on Shannon's random coding technique and on a suboptimal decoder. For 
a given input distribution fx{x) , let us define the following deterministic function: 

(y, H) = l|,(,.y^H)>in(M_i)}- (69) 

For a given codebook C = {ci, . . . , cm}, the decoder computes the M values of (y, H) for the 
given channel output (y, H) and returns the lowest index j for which g^.. (y, H) = 1, or declares an error 
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if there is no such index. Hence, the error probability, given that x = Cj was transmitted, is given by: 
Pr I (y, H) = 0} \J{g,^ (y, H) = 1}| x = c, | < Pr |z(c,-; y, H) < In 

+ ^Pr |2(ci;y,H) > In 
i<j ^ 

(70) 

where the RHS of (TTOI) is obtained by using the union bound and the definition of y as a random vector 
which is independent of x given H and has the same distribution as y given H. 

Let us define the ensemble of the codebooks of size M, that every codeword's component in it is drawn 
independently of each other by fx{x). Averaging (fTOl ) over this ensemble and over the M equiprobable 
codewords we obtain 

Pe < Pr |^(x; y, H) < In (^^) } + E {^(^' H) > In (^^) } > (VI) 

which completes the proof of the existence of a code with M codewords whose average error probability 
is upper bounded by (|67] ). 

Now we turn to prove the equivalent bound (|68T ) of the theorem. For any positive 7 the following 
identities hold: 

E |e-W-^y'H)-l-W]^} = P {l|,(.;y,H)<In(,)} + 7e-^("^'''"^l{.(x;y,H)>in(,)}} (72) 

= Pr {i (x; y, H) < ln(7)} + 7P {e-*(''^y'«)l|i(,,y,H)>inW}} (73) 
= Pr {i (x; y, H) < ln(7)} + 7^^ { l{»(x;y,H)>in(,)} } (74) 

= Pr {i (x; y, H) < ln(7)} + 7Pr {z(x; y, H) > ln(7)} . (75) 

By taking 7 = ^^^^^ we complete the proof. 

It is important to notice that the dependence testing bound is based on a suboptimal decoder which is 
actually a threshold crossing decoder. The decoder computes M binary hypothesis tests in parallel and 
declares as the decoded codeword the first one that crosses the threshold In (^^^^). ■ 



(^)|x = c,}, 



D. Proof of Direct Part 

For the proof of the direct part, we will first construct an ensemble of finite constellation with M 
codewords uniformly distributed in an n dimensional cube Cb(a). Then, using the Dependence Testing 
bound of Theorem |4] with fx{x) = f/(— f , f ), we will find a lower bound on M for a FC whose error 
probability is upper bounded by e. We will denote this lower bound by M{n, e, a/a). Theorem |4] also 
ensures the existence of such a FC that achieves this lower bound. Finally, we will construct an IC by 
tiling that FC to the whole space M", in a way that will preserve the density of codewords and the error 
probability, asymptotically in the dimension n, as in the FC. 

For some fixed a, dimension n and error probability e, consider a code with M(n,a/a) codewords 
belonging to Cb(a). Assume that the codewords are chosen independently of each other according to the 
uniform distribution on Cb(a). The ensemble of all such codes will be denoted by J(n,a). The upper 
bound of the decoding error probability averaged over the ensemble J(n, a) is based on the Dependence 
Testing Bound of Theorem |4] Here, we use uniform input distribution fx{x) = f/(— f , f). 

To use the DT bound of Theorem HI we need to prove that for some 7 the following inequality holds: 

Pe < Pr {i (x; y, H) < ln(7)} + 7E {e-'^''-'^'^%^^,y^n)>inm } < e. (76) 
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Denote for arbitrary r 

ln(7) = nI{X; Y, H) - T^/nVar{i{X;Y, H)). (77) 
The information density is a sum of n i.i.d. RVs: 

n 

z(x;y,H) = 5^z(X,;r„iJ,), (78) 
i=i 

where i{X;Y,H) = In (^"^^^y^^j ^^'^ its moments are given by the following lemma. 

Lemma 5. (Information density's moments) If X U ^) and if the PDF of H is a regular fading 
distribution, then for large enough a/ a and for some positive constant < a < 1 the moments of the 
information density i{X; Y, H) are given by: 

1) /(X; r, H) 4 E{i{X- Y,H)] = E[\\n (^) } + O ((f)- ln(f )) 

2) Var{z{X;Y,H))='^ + Var{6{H)) + 0l{^)Un{^)) 

3) p3 = E{\i{X;Y,H) - I{X-Y,H)\^} < A + ({^) f i^^)] , for some positive and finite constant 
A. 

Proof: See Appendix |B ■ 
According to the Berry-Essen lemma (see Lemma |3]) for i.i.d. RVs, 

\Pr{z (x; y, H) < ln7} - g(r)| < (79) 



where 

B{a/a) = . (80) 

Var-2{i{X;Y,H)) 

For sufficiently large n, let 

r = Q-^(e-( ^ ^^^^^^ = + 5B{a/a)] . (81) 

1^ \y/27rVar{t{X;Y,H)) J J 

Then, from (|791 ) we obtain 

Pr (x; y, H) < ln(7)} < e - 2 ( ^=^^^^L_= + 2B{a/a)] ^. (82) 
Using Lemma |6] (see in Appendix iD]), we get 

,E {e-.^-H.l,.,..,,„,.,,„} < 2 . 2S,„/.)] 4. (83, 



n 

Summing (|82|) and (f83l) we prove the inequality (1761 ). Hence, by Theorem HI there exists a FC with 
M{n,e,a/a) codewords, denoted by S{n,e,a/a), such that 

In (M(n, e, a/a)) = ln(7) + 0(1) 

= n/(X; y, i/) - rv/TiV^ar(i(X; F, //)) + 0(1) (84) 

= n/(X; r, if) - ^JnVar{l{X■Y,H))Q-\e) + 0(1), 

where the last equality is derived by Taylor approximation for Q^^ + O (^"^j j around e. Let us define 
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the NLD of the FC in Cb(a) by 



5(n, e, a/ a) = - In . (85) 



From (|84l) we obtain 

<5(n, e, a/a) = J(X; i/) - ln(a) - ./Z^!S?iSMg-i(e) + O f - V (86) 

V n \n J 

Note that the results of Lemma |5] hold in general for large enough a. Specifically, we can choose a to be 
a monotonic increasing function of n s.t. lim„^oo a = oo, and then the results of Lemma |5] will hold for 
any large enough n. Assigning the results of Lemma |5] with appropriate choice of a = a{n), we get 



. . {1 ,„ (^) } - ^ I " "„ ^'% -(.) (87) 

+ ofi+n"lnnV (88) 



Using Taylor approximation for large enough n. 



\ + ^'"■(^(^» + « ( O * (^)) - iV^-'i^m + O (O » in (^)) , (89) 



Hence, we get 



alo) = - ,/ ^ + + o a + ^ f ^„ + f In ( - ) ) , (90) 



n \n va/ vcr/ \a/ vcr 



a 



By tiling the FC, denoted by S'(n, e, a/cr), to the whole space M" and by choosing for example a{n) = 
a ■ n^+Q , we can construct an IC (See Appendix |G] for details) with average error probability which is 
upper bounded by e and NLD 5{n, e) that satisfies 



Hence, the optimal NLD 5*{n,e) necessarily satisfies 



S'M > = S' - ./ill^lf^^Q-H.) + O f i) , (92) 

V n \n J 

which completes the proof of the direct part. 

We can observe that in the case of AWGN, namely H = 1 deterministically, our result coincides with 
the weaker achievability bound of the dispersion analysis of Ingber et al. in dH. This weaker bound is 
based on the suboptimal typicality decoder. The stronger bound in yj, which is based on the optimal 
ML decoder, is greater than the typicality bound in ^ \n(n). Hence, we conjecture that by using a ML 
decoder, instead of the suboptimal dependence testing decoder, the achievability bound is, actually, given 
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V. Extension to The Complex Model 

In this section we will extend our main result to the complex model. First, we will define the complex 
fading channel model and then we will explain its similarity to the scalar model. Finally, we will give 
the outline of the proof of the theorem in this setting. 

In the complex model, Y = H ■ X + Z where X, H and Z are independent complex RVs. Moreover, 
E {\H\'^} = 1 and Z ~ CA^(0, cr^) with i.i.d. real and imaginary components. 

Generally, H is a complex RV, but since in our model the CSI is known at the receiver, we can assume 
that H is a real and nonnegative RV, without loss of generality. Hence, the complex model is equivalent 
to the following two scalar models: 

Yr = H-Xr + Zr (94) 

Y, = H-X, + Z, (95) 
where, X = Xr + jXi, Y = Yr + jYi and Z = Zr + jZi. 

Theorem 5. Let e be a given, fixed, error probability. Denote by S*{n,e) the optimal NLD for which 
there exists an n complex-dimensional infinite constellation with error probability at most e. Then, for 
any regular fading distribution of H, as n grows, 

. _ / '^'-"fW' Q-M + O (!^) . (96) 
where. SAH) & In (^gi) e,nd 4 E {S,{H)] = B {in (£,) }. 



A. Proof outline of the direct part 

For the proof of the direct part we need to construct an ensemble of finite constellation with M 
codewords uniformly distributed in an n complex-dimensional cube Cb(a). To be more precise, each 
codeword's component (its real and imaginary parts) in the ensemble is drawn uniformly according to the 
distribution |, |) independently of each other. Then, using the Dependence Testing bound of Theorem 
|4]over this ensemble and the Berry-Essen Lemma, we can obtain the existence of EC with M{n, e,a/a) 
codewords and with an average error probability upper bounded by e, which satisfies the following: 



(97) 



e, a/a) ^ In {^ ''^''^X^''^ ) = /(X; Y, H) - Ha^) - ^^^^^l^^^i^g- (e) + O (i 
In this case the information density is given by 

^ ( f{Yr\Xr,H)f{Y,\X,,H) \ (98) 

\ f{Yr\H)f{Y,\H) 
= z{Xr;Yr,H)+i{Xf,Yi,H). 

Hence, by equivalent calculations as in Lemma [51 we can obtain 

IiX;Y,H) = EL f^\\ + oil) 



Var{i{X; F, H)) = 1 + Var ( In ( ) ) + o(l), 



(99) 
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where o(l) converges to zero when aja tends to zero. Combining (|97] ) and (|99l ) gives us the following: 

ala) = + 0(1) - y i + y°'-fe(g)) + °(V .(^) O (i) . (100) 

By tiling this FC to the whole space C" we can prove the existence of IC with an average error 
probability upper bounded by e and NLD that equals the RHS of (l96l) . This completes the proof of the 
direct part. 



B. Proof outline of the converse part 

Using the same arguments as in the scalar fading channel model, we can prove that the sphere packing 
lower bound in the complex model is given by 

Pe > Pf = Pr jllzf > e-^= (^^)'} (101) 

for any IC S with NLD 5„ where ||zf /a"^ ~ x^(2r^) and H = diag{i/i, . . . , Hn}. 

Using the same arguments as in the case of the scalar fading model, we can prove that for any n complex- 
dimensional IC, with NLD Sc and average error probability upper bounded by e over the complex fading 
channel, the following holds: 



4 < s: - 

which completes the proof of the converse part. 



i + ^"'Wi/)) ^_. 1 (ly ,,,,, 

n 2n \n J 



VI. Summary 

In this paper we derived the dispersion normal approximation for the unconstrained fast fading channel 
model with perfect CSI at the receiver. We extended the dependence testing bound, derived in []2l for 
DMCs, and the sphere packing bound, derived in [[Tl for unconstrained AWGN, to the setting of fast 
fading channels. By using these extensions (and some normal approximation techniques), we proved the 
direct and the converse part of our main result, respectively. The connection to the power constrained 
channel model was also discussed, and it was shown that the unconstrained model can be interpreted as 
the limit of the power constrained model when the SNR tends to infinity. Finally, we extended the result 
to the case of a complex fading model. 



Appendix A 
Proof of the Regularization Lemma 

Proof of LemmaU} Fix ^ > and consider the receiver's IC Su, where H is not a ^ - strong fading 
realization. First, we will find large enough a* s.t. the density of the codewords in fl H ' Cb(a*), 
and the average error probability in transmitting codewords from it, over the AWGN channel, are close 
enough to 7rc(H) and e(H). Then we will construct a regular IC by tiling this FC over the whole space 
M". For this IC the desired bounds of the lemma will hold. 
By definition we have 

Pe (Su) = Pe(5|H) = e(H) = limsup — ^ V Pe(s,c|H) (103) 

7rc H = = hm sup , , = lim sup , ■ (104) 

a->oo Vol(H ■ Cb(a)) a^oo det(H)a" 
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From the existence of the limits above there exists s.t. for every a > ao the following holds: 

Z^^mtb) S P.(..|H)<.(H)(l+^/2) (105) 

and 

M{Su,b) ^ 7rc 
b>a det(H)6" Vl+T 

Define A s.t. 

nQr^\ =|-e(H), (107) 

and define oa as the solution of 

Vol(H ■ Cb(aA + 2A)) ( + 2A\ " 



vim ru( \ I V 1 + (108) 

Vol(H ■ Cb(aA)) V / 

Define a^ax = max(ao,aA)- According to (11051 ) and (11061 ) there exists a^, > Omax s.t. 

E ^'.(VIH) < =up E P.K|H)<.(H)(l + {/2)a09) 

^ ^ s,,eSHnH-Cb(a,) 0>amax V ±1, y ^^^g^^j p H-Cb(fe) 

and 

M{Sn,a*) ^ 7rc 
det(H)< v^TT^' 

Define the FC Gh = 5h fl H ■ Cb(a^,), and denote by ^^^"(src) the decoding error probability of any 
codeword s^c € Gh in transmission over the AWGN channel. Since Gu C Sh then P^^{src) < Pe(src|H), 
and the average error probability of the FC is given by 



^e(GH) = ^ J2 ^e^«M<^ i^e(.rc|H)<6(H)(l + e/2). 



(111) 



Now, we will create a regular IC, denoted by 5*^, by tiling the FC Gu to the whole space M" in the 
following way: 

5h = {src + H ■ / ■ (a, + 2A) : s,, eGnJe Z^} , (112) 

where Z„ is the n dimensional integers lattice. 

The error probability of any Src G 5^ equals the probability of decoding by a mistake to another 
codeword from the same copy of the FC Gu or to a codeword in another copy. Hence, the average error 
probability of 5*^, with equiprobable codewords transmission over the AWGN channel, can be upper 
bounded by the union bound as follows: 

Pe[s'^) <Pe{Gn) + J2Q(^)- (113) 

1=1 V ^ / 



Since the given fading channel realization is not a ^ - strong fading realization, and from the definition 
of A we obtain: 

where h*^-^{^) is the solution of Pr{H^i^ < h*^-^} = ^. Combining (|llll) . (11131) and (11141) we obtain the 
desired result: 

Pe(5H) <e(H)(l + 0. (115) 
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The density of S'j^ is given by 

' /TTN \^ ^ M{Su,a*) ( a* Y . . 

7rcl-nj 7rc Vol(H ■ Cb(a, + 2A)) det(H)a^ ' V^* + 2Ay' ' ^ ^ 

Combining (|110|) with the definition of oa and the fact that a* > oa we obtain the desired result: 

7rc > (11^) 

Let us denote by H = diag{/ii, . . . , the given channel realization. By its construction, for any 
E Sjj, the set of points {s^c^K ■ (a* + 2A) -Cj, i = 1, . . . , n} is also in Sjj, where {ej}"^^ is the standard 
basis of M". Hence, any Voronoi cell of 5^ is contained within a sphere of radius tq = y/n{a^: +2A)/;,jnax 
centered around its codeword, where /imax — taax(hi, . . . , /i„). This proves that 5*^ is indeed a regular 
IC. ■ 



Appendix B 

Proof of the Log of Chi Square Distribution Lemma 

Proof of Lemma\2\- By simple variable substitution, we get the following relation between the CDFs 
of Yn and X: 

FyM = FxHn)ine^y). (118) 
Then, if we differentiate (II 181 ) w.r.t. y we will get the following relation between the variables' PDF: 



fvM = V2neVn?'/^.(„)(neV-^). (119) 



Assignment of the x^(n)'s PDF, /^2(„)(x) = ^\ '^^^ > will give us (|57 
From the Stirling approximation for the Gamma function for 2; G M we get 

1 



r{z + i) = zr{z) = V2^(^-) ii + oi-]] (120) 



Using (|120l) for 2; = I we get 



2 

The assignment of (11211) in (|57l) gives us 



n\ r(f + 1) /47r /n^t ^ ^1 



The assignment of the second order Taylor's approximation eV^^ = 1 + -y/fy + ^2/^ + O (^-^^ in (11221) 



gives us 

o 

J- .y 



2^ 1 + 0(1) 



Finally, by the following first order Taylor approximations e ^n^^ = 1 + O l^j and 
1 + O (i) we get m. 



20 



Appendix C 

Proof of the Sum of Two Almost Normal RVs Lemma 
Proof of Lemma^ Xi and X2 are independent. Hence, by definition, the CDF of Y is given by 

^ Pr{Y < y} 

= Pr{X, + X2<y} (^23) 
fxiix) ■ Fx^iy - x)dx. 

-00 

By the assignment of fx^ {x) and Fx2 {x) given by the lemma, we can obtain the following: 

FY{y) = r iV(0, al) ■ Fjv(o,<.|)(y - x)dx 



ry 2^ 
+ 01 / ^ ■ FN(o,al){y - x)dx 



n 



1 r ^d:: 



(124) 



_ -2 

+ 01/ dx 



By simple integration we can get 



FY{y) = I iV(0, a'i) ■ F^(o,.2)(y - x)dx + 0(4= 
FN{o,ai){y) + O ' 



(125) 



which completes the proof of (|60l ). ■ 

Appendix D 
Lemma 6 

Lemma 6. Le? Zi, Z2, . . . , be independent random variables, = J27=i be non-zero and 

T = ^"^^ E{\Zi- E{Zi}\^} < 00; then for any A 

\V2^ (T^ J a 



Proof: See Lemma 47 in [[21. ■ 

Appendix E 

The Channel Output Given CSI Distribution Lemma 

Lemma 7. Suppose that Y = H ■ X + Z, where X ~ f , f ) and Z ~ A^(0, cr^) are independent. If 
H is also a random variable independent of X and Z, then 
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Proof: 

f{y\h) = I f{y,x\h)dx 



-oo 
oo 



f{x)f{y\x,h)dx 

oo 

§ 1 1 



I a V27rcr2 
1 1 



e 2,7 



^rfx (128) 



e 2cr2 — / P 2,72 ctx 



ah y/2na'^ J \pl'no 



Tf«f-?--)-ef?-- 

ah \ \ lo o ) \lo a 
ah\^\a 2a J ^ \a 2a ) 



Appendix F 

Proof of the Information Density's Moments Lemma 
Proof of Lemma |5]- 



A. Preliminary Analysis 

The information density is given by 

f{x,y,h) 



i{x; y, h) = In 
= In 
= In 



f{x)f{y,h), 
fix)f{h)f{y\h,x) 
f{x)f{h)f{y\h) 

f{y\h,x) ' 
f{y\h) 



\j2T[a'^ J \ah \ \a 2a J \a 2a J 



2 V27rea2 J 2a'^ \ \a 2a J ^ \a 2a 

1 , / a^h^ \ z'-a' 
2^" lwJ-^^ + ^'^/'^(^'^) 



where f{y\h) is given by Lemma |7] (see Appendix El) and the following definition of 

a 2a ^ [a 2a 



ea/Ay, = - In ( Q ( ! - S V Q + ) ) • (130) 
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Define the three error's moments for z = 1, 2, 3 by 

e„/<,,^E{el/,(r,if)} (131) 
= E{E{e\f^{Y,H)\H}} (132) 
= E{ea,.^i{H)} (133) 

where 

ea,^,{h)^E{e\iSY,H)\H = h} 



oo 



2a J " y 2a J J \ \ 2a J " y 2a J 



a 



ah 
all \ a 



(134) 



and 

f^^^U r {Q(y.'^\.Q(y+'^\\^,,(Q{y.f\.Q{y + f\Uy,i= 1,2,3. 



\a I ' J-oo\ \ 2(t/ "'V 2a// V V 2(t / "'V" 2a/ 

(135) 

As can be seen in Fig. |3l the function rji (^) is nonnegative, bounded and asymptotically converges to 
a constant for h E (0, oo) and for i = 1,2, 3. The function is also monotonically nondecreasing for i = 1. 
For small values of ah/ a, we can approximate 77, (^) by 



-1)^- r Q\y)\r.' {--Q\y)]dy 
J-00 V ^ / 



-1 — / ^e-^ln* ^e-- ]dy (136) 



■I)'-Ba'(o.i) I f C(ah/a' ■ ^ 



a 



where 



By simple calculation of the moments of a standard normal random variable, we get that for small 
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values of ah /a 



Vi (—] ^ -—C{ah/a), 

( ah\ ah ( ^, , , 1 , 

r/2 — U — C^ahjof + - , (138) 



V3 ^ (^Ciah/af + lc{ah/a) - . 

It can be seen in Fig. [3] that for ah/o < 1 the approximations above are very accurate. 
First, let us calculate the first order error's moment 



ea,.,i^E{e,/„{H)] (139) 

a 

For any regular fading distribution there exists a positive constant /3 > s.t. near the origin f{h) ~ p^- 
Moreover, for any PDF there exists a positive constant /3 > s.t. /(/i) ~ f^T^y large enough h. Hence, 
for large enough a/ a, we can get the following bounds 

1) 

^^^^^"^^ (v)"^^ " ^ f{h)C{ah/a)dh)i (143) 

1 /"Tl^lnf-VM (144) 



2) 



o|/;N/;),,),o(.e)f^l a43, 



i—^dh < r f(h)^Mdh (147) 
ah \ a J J E. ah 

a 

rho i-hi r°° a 

= / f(h)—Mdh+ / f{h)—Mdh+ / f{h)—Mdh (148) 
a/i a/i J;,^ a/i 



a 



a o_ 



o\- + o - / '-\^dh + o 



(149) 

/Cr\ / /cr\ min(^,l)\ 

a) ((a) )■ 

Where (11471 ) is due to the fact that r]i{ah/a) < M for some positive and finite constant M. 
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From (|1421 ). (11461 ) and (11501 ) we get that there exists a constant < a < 1 s.t. the following holds: 



a\ /a 



e./,a=0^1nyy j. (151) 

Finally, because of the common properties of r]i{ah / a) , r]2icLh / a) and r]s{ah/a), with equivalent cal- 
culations, we can get the following error's moments: 



ea/a,3 = E{\ea/.{Y,H)f}=E 



a ( aH 



an \ a 



O In 



a\ (a 



(152) 
(153) 




Vi 

V2 
V3 

rji approx. 
r}2 approx. 
rj3 approx. 



Fig. 3. r]i{ah/a) and its approximation for small values of ah/o. 



B. Calculating the Mutual Information 

The mean of the information density is given by 

I{X;Y,H)^E{z{X;Y,H)} 

















'A" 







C. Calculating the Information Density Variance 
The variance of the information density is given by 



Var(.(X; F, H)) ^ Var (i In (^) - ^ + e./.(y, H)) 
= Var Q In [H^) + Ca/AY, H)^ 

= Var Q In (H^)^ + Var + Var (e,/.(r, H)) 

+ 2Cov Q In [H'') , e,/,(r, iJ)^ - 2Cov (^^, e„/,(y, i/) 
= 1 + yar {5{H)) + A(a/a) 



where 



A(a/a) ^ V^ar (e,/.(y, i/)) + 2Cov Q In (i/^) , e„/<,(F, //)^ - 2Cot; (^^, e,/.(F, 
= ea/.,2 - e^/^^i + E {In (if^) e,/,(F, i/)} - E {in (Z/^) } e,/<,,i 

z2 1 „ r z2 



By the Cauchy Schwarz inequality, 



E {In [E'-) ea,.{Y, H)]\ < J E {in^ (if^)} e,/,,2 = O {^^) 



and 



Combining (11631) . (11641) and (11651) we get 

A(a/a) = O (ye^) = O (in (^) (^)^) . 
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From (11621 ) and (11661 ) we get the desired result: 

Var{i{X-Y,H)) = - + Var {6{H)) + O ( In 

D. Bounding the Information Density 's Absolute third Order Moment 
The absolute third order moment of the information density is given by 

Ps^E{\z{X;Y,H)-I{X;Y,H)\'} 



(167) 



E 



< 



iln(i7^)-E{iln(i7^)} 



+ 



2(t2 



ea/a{Y,H) 



ea/a,l 



(168) 
(169) 

(170) 



where the last inequality is due to the Minkowski inequality and the definition of \\X\\^ = 
By definition we get 



ea/AY,H) = {E{el/^{Y,H)}y =e 



a/a-,3 



o I In i'^) r- 



From (|170l ) and (11711) we get the desired result 



(171) 



(172) 



for some positive and finite constant A. 



Appendix G 
Tiling 



We now turn to construct an IC with average error probability which is upper bounded by e, denoted 
by S{n, e), from the FC S{n, e', a/ a) with an error probability upper bounded by e' (using the suboptimal 
decoder on which the dependence testing bound is based). 

Define the IC S{n,e) as an infinite replication of S{n, e' , a/a) with spacing of b between every two 
copies as follows: 

S{n, e) = {s + I ■ {a + b) : s E S{n, e , a/a), I E Z„} 



where Z„ denotes the integer lattice of dimension n. The NLD of the IC is given by 



5(n, e, a/a, 6) = — In 
n 



In 



n 



M{n,e',a/a) 

{a + 6)" 
M{n,e',a/a) 



In 1 



5{ri, e , a/o") — In ( 1 H — ) 



5* 



l + Var{5{H))^_,^ , 



n 



Q-\e') + 0[- + ^ 



I ray fa 



n 



n \a 



a 



ln(^ 

-a 



where S{n, e', a/ a) is the NLD of the FC in Cb(a). 
Define the faded FC in the receiver, given the CSI, as 



S{n, e , a/a)u = {H ■ s : s E S{n, e , a/a)} 



(173) 

(174) 
(175) 
(176) 

(177) 
(178) 
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where H = diag{iJi, H2, . . . , Hn}. In the receiver, we get the following IC: 

S{n, e)H = + H ■ / ■ (a + 6) : e S{n, e', a/a)^, I E Z^,} , (179) 

which is a tiled version of the faded FC. 

Now consider the ML error probability of a point Src G S{n, e)H, given the CSI H at the receiver, 
denoted by P^^j^{src\'H.). 

If H is a too "strong" channel fading realization then we will declare an error. Formally, if i^min < ^min 
for some arbitrary positive constant /imm' where //mm — min{i/i, H2, . . . , Hn}, then we will declare an 
error. Otherwise, this error probability equals the probability of decoding by mistake to another codeword 
from the same copy of the faded FC S{n,e',a/a)u or to a codeword in another copy. Hence, by using 
the union bound, we obtain the following: 

^S.(^.c|H) < (^PfU^^m + E Q (^) ) ■ 1{H„.„>.* „J + 1{H..<.* 4 (180) 

< <^..(V|H) + nQ + (181) 

The average error probability over S{n,e)ii and H is then upper bounded by 

PeML < PfffL + nQ (^%^) + Pr {i^min < h^J . (182) 

Trivially we have 

PeML < P!St < e, (183) 

where Pfj^^ is the average error probability of the FC using the suboptimal decoder on which the 
dependence testing bound is based. 
By the union bound 

Pr {H^in < hlJ < nPr {H < h^J . (184) 
Combining (fT82l) . (fT83]) and (fT84l) we get that 

PeHiL <e' + nQ (^^) + nPr {H < 4 e. (185) 

From (11771 ) and (11851 ) we can see that for any large enough n, if we choose small enough h^^^, large 
enough b relative to h*^^^/a and large enough a relative to b, then we will get an IC with average error 
probability which is upper bounded by e and arbitrarily close to e', and NLD which equals e) = 

5*-sl&^Q-\e)^0{r^. 

Let us demonstrate this idea by an example. Suppose a regular fading distribution s.t. f{h) ~ -j^^ 
for small enough positive h and for some a > 0. Hence, Pr {H < h'^^^} = O ((/imin)")- If choose 
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/i^ijj(n) = b(n) — a ■ and a(n) — a ■ n^+a, then we will get: 



^e,ML — ^ 



< e' + ne"^ +0 f - 



n 



e' + Oi- 



n 



and 



of 77/ 1 

S{n, e, a{n)/a, b{n)) = 5{n, e', a{n)/a) — In ( 1 + 



an 



^S{n,e-0{l/n),a{n)/a) + ( - 



n 



(n ^ (a(n)) ( cr ) ^ (a(n)) ( a ) 



- ^^^^ (^) + ^ ^nj - 

Note that this operation can be done for any fixed e > (or equivalently for any e' > 0). 



(186) 



^•-Vii^M«-'(-0 (!/")) (.87, 
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